Maintenance of Generalized Association Rules Based on Pre-large Concepts

نویسنده

  • TZUNG-PEI HONG
چکیده

Due to the increasing use of very large databases and data warehouses, mining useful information and helpful knowledge from transactions is evolving into an important research area. In the past, researchers usually assumed databases were static and items were on a single level to simplify data mining problems. Thus, most of algorithms proposed focused on a single level, and did not utilize previously mined information in incrementally growing databases. Items in real world applications are, however, commonly with taxonomy. This paper thus proposes a maintenance algorithm for generalized association rules with taxonomy based on the concept of pre-large itemset. A pre-large itemset is not truly large, but promises to be large in the future. A lower and an upper support threshold are used to realize this concept. The two user-specified upper and lower support thresholds make the pre-large itemsets act as a gap to avoid small itemsets becoming large in the updated database when new transactions are inserted. The proposed algorithm doesn't need to rescan the original database until a number of transactions have been newly inserted. If the database has grown larger, then the number of new transactions allowed will be larger too. Key-Words: data mining, generalized association rule, taxonomy, large itemset, pre-large itemset.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Maintenance of Generalized Association Rules for Record Deletion Based on the Pre-Large Concept

In the past, we proposed an incremental mining algorithm for maintenance of generalized association rules as new transactions were inserted. Deletion of records in databases is, however, commonly seen in real-world applications. In this paper, we thus attempt to extend our previous approach to solve this issue. The proposed algorithm maintains generalized association rules based on the concept ...

متن کامل

An Efficient Maintenance Algorithm of Generalized Association Rules for Record Modification

In this paper, an algorithm is proposed based on the concept of pre-large itemsets to maintain discovered generalized association rules for record modification. A pre-large itemset is not truly large, but promises to be large in the future. A lower and an upper support threshold are used to realize this concept. The two user-specified support thresholds make the pre-large itemsets act as a gap ...

متن کامل

Maintenance of prelarge trees for data mining with modified records

The frequent pattern tree (FP-tree) is an efficient data structure for association-rule mining without generation of candidate itemsets. It is used to compress a database into a tree structure which stores only large items. When data are modified, it, however, needs to process all transactions in a batch way. In the past, the prelarge-tree structure was proposed to incrementally mine associatio...

متن کامل

Association rule mining application to diagnose smart power distribution system outage root cause

Smart grid has been introduced to address power distribution system challenges. In conventional power distribution systems, when a power outage happens, the maintenance team tries to find the outage cause and mitigate it. After this, some information is documented in a dataset called the outage dataset. If the team can estimate the outage cause before searching for it, the restoration time will...

متن کامل

Knowledge-based association rule mining using AND-OR taxonomies

We introduce a knowledge-based approach to mine generalized association rules which is sound and interactive. Proposed mining is sound because our scheme uses knowledge for mining for only those concepts that are of interest to the user. It is interactive because we provide a user controllable parameter with the help of which user can interactively mine. For this, we use a taxonomy based on fun...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004